•  “  DKTTViW  X 

j  for  yuHlr  TiriwTsU.*: 


OPTIMAL  PLACEMENT  OF  IDENTICAL  RESOURCES 
IN  A  DISTRIBUTED  NETWORK + 

by 


Michael  J.  Fischer 
Nancy  D.  Griffeth 
Leo  J.  Guibas 
Nancy  A.  Lynch 


Technical  Report  //81-01-03 


S 


DTIC 

electe 

FEB  1  0  1981 


Accession  For 

NTIS  GRAM 
DTIC  TAB 
Unannounced 
Justification- 


□ 

□ 


By - 

Distribution/ 


Availability  Codes 


Dist 


A 


Avail  and/or 
Special 


^This  research  was  supported  in  part  by  the  National  Science  Foundation 
under  grants  MCS77-02474,  MCS77-15628,  MCS78-01698,  MCS80-03337,  U.S. 

Array  Research  Office  Contract  Number  DAAG29-79-C-0155 ,  and  the  Office 
of  Naval  Research  Contract  Numbers  N00014-79-C-0873  and  N00014-80-C-0221 . 


Apjcor^d  for  puhBc  r 
Dd»t?lbH?5oD  UnSmlkxi 


_ line:  lass  1  tied _ 

^tCuRlTy  CLASSIFICATION  OF  TmIS.  PAGE  fWTiart  Dm#  Entar*d> 

H  REPORT  DOCUMENTATION  PAGE 


WFTrW-W^' 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 
T  RECIPIENT’S  C  A T  A L O G~N U  M 0  E  R 


AUTHOR^; 


Michael  J ,  /Fischer f  Nancy  D./g 
Leo  J.  Guibas^a^»d-  Nancy  A ,/  Lyi 


Grif feth. 


Lynch 


9  PERFORMING  OHGANI  Z  ATION  NAME  AND  ADDRESS 

Department  of  Computer  Science,  FR-35 
University  of  Washington 


6  --PERFORMING  ORG.  REPORT  NUMBER 


8  CONTRACT  OR  GRANT  NUMBERS) 

ONR:  N00014-79-C-0873  &  NOOGlc 
80-C-0221:  NSF  MCS77-02474, 

MCS 77-15628,  MCS78-01 678.  MCSLO 
03337;  ARO:  DAAG29-79-C-0155. 

— I  o7  PROG  R  AM  EL  EM  E  N  T,  >  ROJECtT  TASK  " 
AREA  8  WORK  UNIT  NUMBERS 

NR  049-456/30  Oct  79  (437) 


tl  CONTROLLING  OFFICE  NAME  ANO  ADDRESS 


Office  of  Naval  Research,  800  N.  Ouincy,  Arl±»£lgrt  ,  Jann— y^81 
VA  22217.  ATTN:  Dr.  R.B.  Grafton  /  NSF,  Washing t (JA 7>wmber  or  pages 
Mi  ,20^30  AS,  Army,  Research  Off  j  -  "  " 


1 4  MONITORING  AGENCY  NAME  ft  ACJtJRESSfU  dilfarant  from  Controlling  Office)  IS.  SECURITY  CLASS,  (of  thla  raport) 

Office  of  Naval  Research  -  /  n.  u.arjo 

801  N.  Quincy  Unclassified 

Arlington,  VA  22217  {*  d  ^ _  15*  decl assif,c ation /down gr^c 

ATTN:  Dr.  R.  B.  Grafton _ j _ 

16.  DISTRIBUTION  STATEMENT  (of  thi*  Raport) 


Unclassified 

is#,  decl assifYcation/oowngraoing 
schedule 


Approved  for  public  release;  distributed  unlimited. 


TT  DISTRIBUTION  statement  (of  tha  abatract  an tfad  In  Block  20,  U  dlffmrant  from  Raport) 


18  supplementary  notes 


[  19  key  WOROS  (Conr/nu#  on  ravaraa  alda  it  nacaaamry  and  Idmnttfy  by  block  nuoi bar) 


Computer  network,  distributed  system,  efficient  placement  algorithm, 
optimality,  resource  allocation  problem. 


To"  ABSTRACT  (Contlnua  on  ravraa  alda  It  nacaaamry  and  IdantUy  by  block  numbar) 

"v  The  problem  is  considered  of  locating  a  number  of  identical  resources 
at  nodes  of  a  tree  so  as  to  minimize  the  total  expected  cost  of  servicing 
a  set  of  random  requests  for  the  resources.  The  cost  of  servicing  a  request 
is  the  tree  distance  from  the  requesting  node  to  the  node  at  which  the 
resource  satisfying  the  request  is  located.  An  algorithm  for  finding  an 
optimal  placement  of  t  resources  is  presented^jbhich  runs  in  time 

0(t^*|  E  |),  where  E  is  the  edge  set  of  the  tree.  In  the  special  case  of 


FORM 

I  JAN  73 


EDITION  OF  1  NOV  69  IS  OBSOLETE 
S/N  0102  L  F  014  6601 


Vnclassif led 

SECURITY  CLASSlVlCATlOlToF  THIS  P AOt fmarart) 


1 


OPTIMAL  PLACEMENT  OF  IDENTICAL  RESOURCES  IN  A  DISTRIBUTED  NETWORK' 


Michael  J.  Fischer  Leo  J.  Guibas 

University  of  Washington  Xerox  Palo  Alto  Research  Center 

Nancy  D.  Griff eth  Nancy  A.  Lynch 

Georgia  Institute  of  Technology  Georgia  Institute  of  Technology 


ABSTRACT 

The  problem  is  considered  of  locating  a 
number  of  identical  resources  at  nodes  of  a 
tree  so  as  to  minimize  the  total  expected  cost 
of  servicing  a  set  of  random  requests  for  the 
resources.  The  cost  of  servicing  a  request  is 
the  tree  distance  from  the  requesting  node  to 
the  node  at  which  the  resource  satisfying  the 
request  is  located.  An  algorithm  for  finding 
an  optimal  placement  of  t  resources  is  pre- 
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sented  which  runs  in  time  0(t  *|E|),  where  E 
is  the  edge  set  of  the  tree.  In  the  special 
case  of  a  complete  binary  tree  with  requests 
uniformly  distributed  over  the  n  leaves, 
another  algorithm  works  in  time  0(t  log2n) . 

While  optimal  placements  in  general  seem  diffi¬ 
cult  to  characterize,  there  is  a  very  simple 
placement  whose  total  cost  differs  from  opti¬ 
mality  by  at  most  |E|.  The  expected  cost  of 
this  placement  on  a  complete  binary  tree  is 
0(n  +  /tn) . 

1.  INTRODUCTION 

We  consider  the  problem  of  locating  some  num¬ 
ber  t  of  identical  resources  at  nodes  of  a  dis¬ 
tributed  network  in  such  a  way  as  to  minimize  the 
expected  "cost”  of  servicing  a  random  set  of  t 
requests  for  those  resources.  Various  different 
costs  are  likely  to  be  important  in  different 
situations . 

For  example,  suppose  the  resources  are  pro¬ 
cessors  and  requests  are  to  establish  a  virtual 
connection  with  a  processor  which  will  be  used  for 
a  very  long  period  of  time.  In  this  case  one 
might  want  to  minimize  the  expected  total  of  all 
the  network  distances  (measured  in  some  appropriate 
way)  between  requesting  users  and  their  assigned 
processors.  Because  of  the  long  holding  times,  it 
is  probably  reasonable  to  expend  considerable 
effort  to  find  a  good  matching  of  requests  to 

^This  research  was  supported  in  part  by  the 
National  Science  Foundation  under  grants  MCS77- 
02474,  MCS77-15628,  MCS78-01698,  MCS80-03337,  U.S. 
Army  Research  Office  Contract  Number  DAAG29-79-C- 
0155,  and  the  Office  of  Naval  Research  Contract 
Numbers  N00014-79-C-0873  and  N00014-80-C-0221 . 


resources.  The  total  of  the  network  distances 
provides  a  measure  of  the  expected  communicat ion 
traffic  introduced  into  the  network  by  the  compu¬ 
tations. 

For  another  example,  suppose  the  resources 
are  tickets  to  sporting  events  (or  airline  seats). 

A  relev«.at  cost  is  the  expected  waiting  time  until 
a  buyer  receives  his  ticket,  or  equivalently,  the 
expected  total  waiting  time  for  all  buyers.  In 
this  situation,  it  is  probably  not  reasonable  to 
expend  much  effort  in  attaining  a  near-optimal 
matching,  for  the  time  to  find  the  matching  can 
easily  exceed  the  eventual  savings  in  locating  a 
nearby  ticket.  Even  so,  the  expected  total  dis¬ 
tance  in  an  optimal  matching  is  significant  as  a 
lower  bound  for  the  expected  total  waiting  time. 

This  paper  investigates  properties  of  optimal 
resource  placements  and  provides  fast  algorithms 
for  finding  them.  It  also  provides  upper  bounds 
for  the  costs  of  optimal  placements  and  of  certain 
nearly-opt imal  and  easily-described  placements. 

The  network  in  all  cases  is  assumed  to  be  con¬ 
figured  as  a  tree.  Nevertheless,  the  upper  bounds 
can  often  be  applied  to  an  arbitrary  (connected) 
distributed  network  by  constructing  a  spanning 
tree . 

In  this  paper,  we  allow  for  the  possibility 
that  fractions  of  resources,  and  not  only  whole 
resources,  might  be  located  at  some  nodes  of  the 
network  tree.  This  is  reasonable  if,  for  instance, 
the  resources  are  large  blocks  of  available  data 
storage  space.  It  would  be  perfectly  permissable 
for  a  user  to  obtain  paris  of  his  needed  storage 
from  several  different  nodes.  On  the  other  hand, 
we  assume  that  the  requests  are  discrete  —  each 
request  is  for  one  unit  of  resource. 

In  Section  2,  we  present  our  notation,  defini¬ 
tions,  and  those  results  which  apply  to  arbitrary 
trees  and  arbitrary  probability  distributions  for 
arrivals  of  requests.  §2.1  defines  matchings  and 
relates  them  to  network  flows.  §2.2  defines  the 
expected  total  cost  of  a  placement  in  terms  of 
expected  total  flow.  Both  the  function  describing 
the  expected  flow  on  each  edge  and  the  function 
describing  the  minimum  possible  expected  total  flow 
for  any  subtree  are  of  a  particularly  simple 
form  —  they  are  unbounded,  convex,  piecewise 
linear  functions  on  the  nonnegative  reals,  with 
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all  singularities  at  integers.  This  immediately 
implies  (in  §2.3)  that  there  are  always  optimal 
resource  placements  consisting  of  whole  numbers  of 
resources  located  at  each  node;  it  is  never  neces¬ 
sary  to  place  fractions  of  resources  at  any  node 
in  order  to  achieve  an  optimal  placement.  An 
2 

algorithm  using  0(t  '//edges)  arithmetic  opera¬ 
tions  is  presented  which  always  finds  an  optimal 
whole  resource  placement.  This  is  the  fastest 
algorithm  we  have  which  is  completely  general. 
Section  4  contains  faster  algorithms  for  special 
cases. 

§2.4  considers  what  is  required  for  a  place¬ 
ment  to  optimize  the  expected  flow  over  any 
particular  edge  in  the  tree  —  i.e.  to  be  locally 
optimal.  Unfortunately,  it  is  not  possible  in 
general  to  obtain  a  single  placement  which  simul¬ 
taneously  optimizes  the  expected  flow  on  all  edges. 

A  fair  whole  placement  is  one  which  "almost 
optimizes"  the  flow  on  each  edge  —  that  is,  one 
in  which  the  number  of  resources  in  each  subtree 
is  either  the  mean  rounded  up  or  down.  It  is 
always  possible  (and  quite  easy)  to  obtain  a 
fair  whole  placement,  and  such  placements  are 
close  to  optimal. 

In  §2.5,  we  show  that  if  there  is  a  proba¬ 
bility  less  than  %  that  any  request  will  arrive  in 
a  particular  subtree,  then  it  is  always  bad  to 
place  even  a  very  small  fraction  of  a  resource 
anywhere  in  that  subtree. 

In  Section  3,  we  analyze  the  cost  of  an 
optimal  placement  for  the  case  of  a  complete  binary 
tree  of  n  leaves  but  with  an  arbitrary  probabi¬ 
lity  distribution  for  request  arrivals.  An 
arbitrary  fair  whole  resource  placement  has 
expected  total  cost  at  most  0(n  +  /t  /n) .  Note 
that  in  the  very  important  case  where  t  is 
roughly  proportional  to  n  (i.e.  the  number  of 
resources  in  the  network  is  proportional  to  the 
number  of  nodes) ,  that  the  expected  average  cost 
per  request  is  bounded  by  a  constant,  independent 
of  the  size  of  the  network.  This  situation  is 
very  different  from  the  case  of  centralizing  the 
resources,  where  the  average  cost  grows  propor¬ 
tionately  with  the  log  of  the  number  of  nodes  in 
the  network. 

2.  NOTATION,  DEFINITIONS  AND  GENERAL  RESULTS 
2.1*  Trees,  Matchings  and  Flows 

A  tree  T  *  (V,E)  is  an  undirected  acyclic 
graph,  where  V  is  the  vertex  (node)  set  and  E 
is  the  edge  set. 

Let  N  denote  the  natural  numbers,  including 
0,  and  let  R+  denote  the  nonnegative  reals. 

A  set  of  requests  is  described  by  a  function 

r:  V^R  such  that  r(v)  is  the  number  of 
requests  originating  at  node  v.  A  placement  of 

resources  is  described  by  a  function  s:  V >  R+ 


such  that  s(v)  is  the  number  of  resources  placed 
at  node  v.  We  always  assume  total(r)  *  total(s), 

where  for  any  g:  V  +  R  ,  total (g)  *  £  g(v). 

veV 

A  matching  is  a  function  m:  VxV-*R+.  m(u,v) 
gives  the  number  of  requests  at  node  u  which  are 
satisfied  by  resources  at  node  v.  m  is  exact 

for  r,s  if  for  all  u,veV,  £  m(u,w)  -  r(u)  and 

weV 

^  m(w,v)  *  s(v).  Thus,  an  exact  matching  gives  a 
weV 

complete  correspondence  between  requests  and 
resources.  Clearly,  an  exact  matching  exists 
whenever  total(r)  -  total(s). 

The  cost  of  a  matching  is  defined  to  be 

cost(m)  *  l  m(u,v) *d(u,v) 
u,veV 

where  d(u,v)  is  the  number  of  edges  in  the 
unique  path  from  u  to  v  in  T.  Let 

cost(r,s)  *  min  (cost(m)  !  m  is  an  exact 

matching  for  r,s}. 

A  matching  m  is  optimal  for  r,s  if  cost(m)  = 
cost (r ,s) . 

It  is  convenient  to  relate  a  matching  to  a 
flow  in  a  directed  graph.  Choose  a  node  w,  and 
direct  the  edges  of  the  tree  to  point  away  from  w. 
Corresponding  to  the  usual  way  of  drawing  trees 
with  the  root  at  the  top,  we  let  high(e)  be  the 
endpoint  of  edge  e  which  is  closest  to  w,  and 
low(e)  be  the  endpoint  farthest  from  w. 

In  the  remainder  of  this  paper,  it  will  be 
convenient  for  w  itself  to  have  an  edge  entering 
it.  Therefore,  we  add  a  new  node  v  to  serve  as 
the  root,  and  we  let  rootedge(T)  =  (v,w) ,  an  edge 
from  v  to  w.  Resources  will  never  be  placed 
on  v,  and  requests  will  never  originate  at  v; 
v  and  rootedge(T)  are  just  notational  conven¬ 
iences  . 

A  flow  is  a  function  f:  E  -►  R  which  des¬ 
cribes  the  "movement"  of  resources  from  their 
initial  placement  to  corresponding  requests.  A 
positive  value  of  f(e)  denotes  a  flow  along  the 
direction  of  the  edge  (i.e.  towards  the  leaves) 
whereas  a  negative  value  of  f(e)  denotes  a  flow 
towards  the  root.  A  flow  is  stable  for  r,s  if 
for  every  veV, 

r(v)  +  £  f(e)  ■  s(v)  +  J  f(e’)‘ 

e:high(e)=v  e':low(c)=v 

Thus,  at  every  node,  the  flow  out  equals  the  flow 
in . 

The  cost  of  a  flow  is  defined  to  be 

cost (f )  -  £  i  f (e) I . 

ecE 

Call  a  flow  f  optima  1  for  r,s  if  it  is  stable 
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for  r,s  and  has  minimal  cost  over  all  such  flows. 

The  following  theorem  formalizes  the  intui¬ 
tion  the  flows  correspond  to  resources  moving 
along  edges  and  that  in  an  optimal  matching, 
resources  do  not  move  both  directions  along  the 
same  edge.  We  omit  the  straightforward  but 
tedious  proof. 

Theorem  2,1,1.  Let  m  be  an  optimal  match¬ 
ing  and  f  an  optimal  flow  for  r,s.  Then 
cost(m)  *  cost(f).  Moreover,  if  either  m  or  f 
has  an  integral  range,  then  the  other  can  be 
chosen  so  also. 

The  removal  of  any  edge  e  of  a  tree  T 
splits  the  tree  into  two  disconnected  components: 
B(e),  the  nodes  "below"  e,  and  A(e) ,  the  nodes 
"above"  e.  B(e)  and  A(e)  are  defined  by: 

B(e)  *  {veV  |  v  is  in  the  subtree 
rooted  by  low(e) } 

A(e)  =  V  -  B(e)  . 

The  flow  along  e  in  any  flow  for  r,s  depends 
only  on  the  total  number  of  requests  and  the  total 
number  of  resources  in  B(e).  For  any  function 

g:  V -*■  R+  and  ecEt  let 

total(e,g)  *  l  g(v). 

vcB(e) 

If  total(e,r)  exceeds  total(e,s),  then  there 
are  more  requests  than  resources  in  the  subtree 
B(e),  so  the  excess  must  be  satisfied  by  resources 
flowing  into  B(e)  via  the  edge  e.  On  the  other 
hand,  if  total(e,s)  exceeds  total(e,r),  the 
excess  resources  in  B(e)  are  needed  by  requests 
in  A(e)  (since  total(r)  *  total(s)),  so  they 
must  flow  out  of  the  subtree  via  e.  By  our  con¬ 
vention  that  the  sign  of  the  flow  denotes  its 
direction,  the  directed  flow  in  either  case  is 
given  by 

dflow  (e)  *  total(e.r)  -  total(e,s). 
r ,  s 

Theorem  2.1.2.  dflow  is  the  unique 

r ,  s 

stable  flow  for  r,s. 

Proof .  The  argument  sketched  above  shows 

that  there  is  at  most  one  stable  flow  for  r,s. 

We  leave  to  the  reader  to  show  that  dflow  is 

r  ,s 

stable  for  r,s.  f] 

Theorems  2.1,1  and  2.1.2  immediately  yield: 


distributions  for  sets  of  requests  and  define  costs 
of  placements  in  termr  ->f  their  expected  costs 
given  a  randomly  chosen  set  of  requests. 

If  is  a  probability  function  on  V  and 

teN,  then  a  random  r:  *  ♦  K+  can  be  chosen  accord¬ 
ing  to  a  probability  distribution  determined  as 
follows:  for  each  i  in  turn,  lii£t,  $  is 

used  to  select  a  vertex.  Then  r(v)  is  the  total 
number  of  times  v  is  selected  for  each  vcV.  We 
always  assume  $(root)  -  0. 

Fix  $ ,  t  as  above,  and  let  s:  V  -►  R*  with 
total (s)  *  t.  Then  expcogt (s)  denotes  the 
expected  value  of  cost(r,s),  where  r  is  chosen 
as  described  above,  and  minexpcost  = 

min  (expcost(s)  |  s:  V  -*■  R+  and  total  (s)  *  t}. 

The  flow  along  an  edge  e  depends  only  on 
total(e,s),  the  number  resources  in  the  subtree 
below  e,  and  not  on  their  particular  placement. 

Let  expf low^(u)  be  the  expected  value  of 

| dflow  (e)  |  ,  where  r  is  chosen  randomly  as 
r ,  s 

described  above,  and  s  is  any  placement  with 
total(e,s)  «  u. 

The  following  two  expansions  are  easy  to  see. 

Theorem  2,2.1.  Expcost(s)  * 

£  expflow  (total (e ,s) ) . 
e£E  e 

Theorem  2,2,2.  Expflow^(u)= 

l  (Sp1(l-p)t  i|u-i|,  where  p  -  £  <Kv) . 
i-0  veB(e) 

The  next  theorem  shows  that  the  expflow 
function  has  a  simple  form. 

Theorem  2.2.3.  For  any  fixed  <f,  til,  and 
e  e  E,  expflow^  is  an  unbounded,  convex,  piecewise 

linear  function  from  R+  to  R+,  with  all  singu¬ 
larities  occurring  at  integer  values. 

Proof .  By  Theorem  2.2.2,  expflow^u)  is  the 

sum  of  functions  g,(u)  =  k.*|u-i|,  where  k. 

&i  l  1  1  *  i 

does  not  depend  on  u,  0  t,  i  l  t .  Each  g^  is  a 

convex,  piecewise  linear  function  from  R+  to  K+ 
with  a  singularity  at  u  *  i,  and  at  least  one  of 
the  *s  unbounded.  Since  addition  preserves 

all  four  required  properties,  the  result  follows.' 


Theorem  2.1.3.  cost(r,s)  *  cost(dflow^  ). 

.  r ,  s 

Thus,  our  original  problem  of  studying  opti¬ 
mal  exact  matchings  reduces  to  the  problem  of 
studying  a  particular  stable  flow. 

2.2.  Expected  Costs 


If  e e  E,  let  E  denote  the  set  consisting 
of  e,  together  with  all  edges  below  c  in  T, 
so  d  t  if  low(d)  t  B(e) .  Def ine 


expcost  (s) 


and 


J  expflow  .(total  (d,s) ) 
dcE 

e 


In  this  section,  we  introduce  probability 
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Thus,  expcoste(s)  gives  the  portion  of  the 

expected  cost  of  placement  s  due  to  flow  in  the 
subtree  below  (and  including)  e,  and 
minexpcoste(u)  gives  the  least  such  costt  sub¬ 
ject  to  the  constraint  that  exactly  u  resources 
be  placed  in  the  subtree. 

From  these  definitions  and  Theorem  2.2.1,  it 
should  be  clear  that 

expcoste(s)  *  expflowe(total(e,s) ) 

l 

+  J  expcost  (s) 
i-1  ei 

where  are  the  immediate  descendants  of 

edge  e.  Optimizing  over  all  s  with 
total(e,s)  *  u,  we  get 

Theorem  2.2.4.  Let  t,  and  e  be  fixed. 

Let  e . e.  be  the  immediate  descendant  edges 

1  *  + 
of  e,  and  let  ueR  .  Then 

minexpcoste (u)  * expf low^ (u) 

Z 

+  min{  J  minexpcost  (u.)|lu.^u}. 
i-l  ei  1 

In  order  to  obtain  more  information  about  the 
values  of  the  minexpcoste  function,  we  first 

show  that  it  has  a  simple  form. 

Theorem  2.2.5.  For  any  fixed  $ ,  t  £1,  and 
e,  minexpcost^  is  an  unbounded,  convex,  piece- 

wise  linear  function  from  R+  to  R+,  with  all 
singularities  occurring  at  integer  values. 

Proof .  We  use  induction  on  edges  in  the  tree, 
working  from  the  leaves  toward  the  root. 

If  e  is  a  lowest  edge,  then  there  is  only 
one  edge,  e,  in  Eg.  Thus,  minexpcost  * 

expflow  ,  which  has  the  needed  properties  by 
Theorem  2.2.3. 

Now  assume  the  result  holds  for  edges  below 
e,  and  let  e^,...,e^  denote  the  immediate 
descendant  edges  of  e.  Consider  the  expression 
for  minexpcoste(u)  given  in  Theorem  2.2.4.  The 
first  term  has  the  needed  properties  by  Theorem 
2.2.3,  It  remains  to  show  that  the  second  term  is 
convex,  piecewise  linear  and  has  all  singularities 
at  integers. 

Write  f  for  minexpcost  ,  g(u)  for 
1  ei 

min  (  Z  f  (u^)  |  Z  s u  }  . 

By  the  induction  hypothesis,  each  Is 

unbounded,  convex  and  piecewise  linear  with  all 
s inguJari t i es  at  integers.  Hence,  the  derivative 
f] (x)  is  defined  at  all  non-singular  points,  Wc 


define  f *(n)  for  n  a  singularity  of  f ^  to 
the  limit  of  f^(x)  as  x  approaches  n  from 
above.  By  convexity  and  linearity  of  f^,  is 

a  non-decreasing  step  function  over  the  domain  R+ 
with  steps  occurring  at  integer  points. 

For  each  i,  l<i<£,  let  r.  be  the  small- 
est  element  of  R  with  f*(r^)  *  0.  exists 

since  f ^  is  unbounded,  and  r^f N  by  the  proper¬ 
ties  of  f^.  We  consider  two  cases: 

Case  1.  u^Z  r^.  Then  g(u)  -  Z  f^r^,  so 
g  is  constant  for  all  such  u. 

Case  2.  u<E  Let  S  *  (f|(x)  | 

l<i<£,  x  £  [0,  r  )]},  For  ocS  and  lsi<£, 
define 

,  x°  *  gib  {z  |  f !(z)  £  o) 

y°  *  lub  {z  |  f | (z)  <  o) 

By  the  properties  of  f ^ ,  we  have  x^, 

y?£N,  x°^y.^r,,  and  f!(z)  =  o  iff  z  e 

i  i  i  1  i 

(x°.  y°) .  Hence,  the  intervals  [x^,  y^)  for 

ocS  partition  [0,  r  ,  so  the  intervals 

lo  *  ^xi*  for  ocS  Petition  [0,  Z  r.). 

Choose  o  e  S.  We  now  show  that  g  is  linear 
over  I  .  Let  uel  .  Choose  u  =  (u. . uj 

such  that  g(u)  *  l  f^Cu^)  and  Z  £  u.  By  the 
choice  of  u  and  using  the  facts  that  f|(z)  <o 
iff  z  <  x^  and  f^(z)  >  o  iff  z  >  y°,  it  is 
straightforward  to  show  that  x^  <  u^,  £  y^  and 

Z  «  u.  Hence,  f^u^  =  f ^ (x^^)  +  c  •  (Uj  -  )  . 

Summing  over  all  i,  we  get 
l 

g(u)  =  l  f  (u  ) 

1=1 

*  Z  f^x^)  +  ui  -  Z  x°> 

*  Z  f 1(x^)  +  o* (u  -  Z  x°>  , 

a  linear  function  of  u  as  desired. 

The  convexity  of  g  follows  from  the  mono- 
tinicitv  of  the  f|.  [! 

Examination  of  the  proof  of  Theorem  2.2.5 
allows  us  to  sharpen  Theorem  2.2.4  bv  stating  that 
the  minimum  cost  can  always  be  achieved  by  placing 
whole  resources  on  all  vertices. 
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Theorem  2.2.6.  Let  $>,  t,  e  be  fixed. 

Let  . be  the  immediate  descendant  edges 

of  e,  and  let  ueN.  Then  minexpcost  (u)  * 

£  e 

expflow  (u)  +  min  {  J  minexpcost  (u  )  jlu,  £u  and  u,  cN). 

i-1  ei  1  1  1 

2.3.  Optimal  Placement 

We  are  interested  in  determining  the  "best*1 
placement  functions  s:  V -+  R+  in  the  following 

sense.  We  say  s;  V -►  R+  is  optimal  for  <J>,  t 
provided  total(s)  -  t  and  expcost  (s)  *  minexpcost. 
The  first  characterization  result  follows  imme¬ 
diately  from  Theorem  2.2.6  and  shows  that  there 
are  optimal  s  which  take  on  integral  values  only. 

Theorem  2.3.1.  For  any  probability  function 
$  and  any  t  €  N,  there  exists  s:  V  +  N  which  is 
optimal  for  t. 

Proof .  Theorem  2.2.6  essentially  provides  an 
algorithm  for  producing  such  s.  For  any  edge  e 
of  T  with  immediate  descendants  e^,...^^,  and 

any  ueN,  one  determines  values  of  s  for  all 
nodes  below  e  by  considering  all  possible  decom¬ 
positions  with  Zu^^u  and  u^cN  for 

all  i.  For  each  such  decomposition,  one  recur¬ 
sively  determines  values  of  s  for  all  nodes 
below  each  e^,  and  corresponding  costs.  The 

decomposition  with  the  smallest  total  cost  is 
chosen.  □ 

In  order  to  analyze  the  cost  of  determining 
an  optimal  placement  as  above,  we  do  not  perform  a 
straightforward  recursive  analysis  of  the  algorithm 
described  in  the  proof  of  Theorem  2.3.1.  Rather, 
we  take  advantage  of  repeated  work  in  various 
recursive  calls.  During  the  algorithms,  one  must 
calculate  expflowe(u)  for  all  e  t  E  and 

all  u,  O^ust.  The  number  of  arithmetic  opera¬ 
tions  involved  in  one  calculation  of  expflowe(u) 

is  0(t)  (if  performed  judiciously),  independent 
of  e  and  u.  It  is  these  costs  which  dominate 
the  total  count  of  arithmetic  operations,  so  that 
2 

an  0 ( t  • J  E  | >  analysis  results.  We  summarize 
this  discussion  in  the  following  theorem. 

Theorem  2.3.2.  There  is  an  algorithm  using 

2  i 

0(t  •]  E  |)  arithmetic  operations  which,  for  any 
tree  T  =  (E,V)  probability  function  $ ,  and 
t  £  N,  determines  a  placement  of  whole  resources 
which  is  optimal  for  $,  t. 

2.4.  Optimizing  Flow  on  Individual  Edges 

In  this  section,  we  show  that  the  flow  on 
each  individual  edge  is  optimized  for  a  number 
equal  to  the  median  of  an  appropriate  binomial 
distribution.  We  use  this  to  bound  the  distance 
from  optimal  of  two  simple  placements. 


a  random  variable  whose  value  is  the  number  of  suc¬ 
cesses  in  n  independent  trials,  each  of  whose 
probability  of  success  is  p.  Define  median (n ,p) 

as  the  smallest  c  e  R+  such  that  Pr[xSc]  . 

Theorem  2,4.1.  Let  <p  be  a  probability 
function  on  V,  tcN  -  {0},  e  e  E.  Then 
expflow^Cu)  is  minimized  at  u  =  median(t,p), 

where  p  -  J  <J>(v). 
veB(e) 

Proof .  Write  f  for  expflow^.  By  Theorem 

2.2.3,  f  is  minimized  at  an  integer  u  which  is 
the  smallest  reN  with  f(r+l)  -  f(r)  nonnega¬ 
tive.  Now 

f(r+l)  -  f(r) 

■  I  (hp1(l-p)t_1|r  +  X  -  i | 
i-0 

-  I  (J)pi(X-p) t-i | r  -  i | 
i=0 

by  Theorem  2.2.2, 

“  I  (bpi(l-p)t  i(  |r+  1  -  i  |  -  |  r  -  i| ) 
i=0  1 

=  I  (hpid-p)t"1-  I  (bp^x-p/-1 

i=0  i=r+l 

-  2  l  (hpi(l-p)t_1-l 

i=0 

=  2Pr  [x  <  r]  -  1. 

Thus,  u  is  the  smallest  reN  with  2Pr[x£r]- 
1  nonnegative,  or  Pr[x  ^  r]  >^;  that  is,  u  * 
median (t,p). 

The  following  theorem  follows  immediately 
from  Theorem  3.2  and  Corallary  3.1  of  Jogdeo  and 
Samuels  [1],  It  is  also  implicit  in  some  earlier 
work  by  Uhlmann  [2,3]. 

Theorem  2.4.2  (Jogdeo  and  Samuels).  Let 
nf  N,  n  -  1 ,  Ospsl.  Then  median(n,p)  * 

{ [npj  ,  fnp"|  )  • 

Since  np  is  the  mean  number  of  successes  in 
n  independent  trials  with  success  probability  p, 
this  theorem  implies  that  the  mean  and  median 
differ  by  less  than  one. 

Thus,  each  edge  individually  has  its  flow 
optimized  at  a  value  which  is  either  the  mean 
rounded  up  or  the  mean  rounded  down.  However,  it 
is  not  always  possible  to  achieve  this  local 
optimum  consistently  throughout  the  tree.  In  fact, 
in  this  very  general  setting,  an  optimal  placement 
might  require  some  subtree  to  contain  a  value 
other  than  the  mean  rounded  up  or  down. 
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Let  iuN  -  (0),  0  '  p  s  1.  Let  x  be 


For  any  T,  $,  t,  and  for  each  edge  eeE, 

Z  + 

4>(v) .  s :  V  +  R  is  an  exact  fair- 

veB(e) 

share  placement  for  T,  t  if  for  each  eeE, 

total  (efs)  *  tp&.  s':  V  +  N  is  a  fair  whole 

placement  for  T,  $>,  t  if  for  each  eeE, 

LtpJ  £total(e,s)  £  ftpe"|. 

It  is  not  difficult  to  see  that  for  any  T, 
t,  an  exact  fair-share  placement  and  a  fair 
whole  placement  exist.  Let  s(v)  =  t*<£(v)  for 
v c  V.  Then  s  is  an  exact  fair-share  placement. 
Let  s'(v)  *  |_t*$(v)J  if  v  is  a  leaf.  Let 
s'(v)  =0  if  v  is  the  root.  For  v  not  a  leaf 
or  root,  let  e  be  the  edge  with  low(e)  *  v, 
and  let  en,...,e  be  its  immediate  descendants. 

1  *  l 

Let  s'(v)  *  [tpl  -  J  [tp  J.  An  easy  induction 
^  i-1  ei 

shows  that  s’  is  a  fair  whole  placement  for  T, 
t,  and  in  fact,  total(e,s’)  *  [tp^J  f°r 
every  eeE. 

In  the  special  case  that  4>  is  non-zero  only 
on  leaves,  then  there  is  a  fair  whole  placement 
s"  which  is  non-zero  only  on  leaves.  It  can  be 
obtained  by  a  simple  recursive  construction. 

Start  at  the  root  of  T  with  t  resources  to 
distribute.  Below  any  edge  e  in  the  tree,  we 
will  have  either  [tpgJ  or  ftp^  to  distribute. 

Assume  e  has  descendants  e^,...e^.  Distribute 

[tp.  J  or  ftp  1  to  the  i  subtree.  Since 
ei  ei 

l  Ltpe  J s  LtpJ  s  rtpel  *  l  Ttpe  1 

1-1  i  1=1  1 

the  distribution  is  possible.  We  then  proceed 
recursively  on  e^,...,e^, 

Example  2.4.1.  Let  T  be  a  32-leaf  complete 
binary  tree  (except  for  rootedge(T) ) ,  ^^^”256’ 

for  each  of  the  leftmost  16  leaves  I,  -jig-  for 

each  of  the  rightmost  16  leaves,  and  0  for  all 
other  nodes.  Let  t  *  16.  The  placement  s  which 
has  s(£)  *  1  for  each  of  the  leftmost  16  leaves 
l,  and  0  elsewhere,  has  total(e,s)  =  16  for  e 
the  left  descendant  of  rootedge(T).  However,  the 
mean  number  of  requests  in  the  subtree  below  e 

is  t  •  *  15,  so  s  is  not  a  fair  whole 

lb 

placement.  Nevertheless,  one  can  determine  by 
exhaustive  searching  that  s  is  optimal,  and 
expcost(s)  is  strictly  smaller  than  expcost(s') 
for  any  fair  whole  placement  s'. 

We  note  in  general,  however,  that  the  optimal 
cost  cannot  be  too  much  less  than  the  cost  of  any 
exact  fair-share  placement  or  fair  whole  placement. 

Theorem  2. A. 3.  Let  s  be  an  exact  fair- 
share  placement  or  a  fair  whole  placement  for  T, 
t.  Then  expcost(s)  sminexpcost  +  I  E  I. 


Proof .  By  the  results  of  this  section, 
^xpflowe  is  minimized  at  some  u,  where 

|_tPeJ  ^  u  <  [tPgl  •  Thus,  I  total  (e ,  s)  -  u  |  <  1 .  But 

then  |expflow^  (total (e ,s) )-  expflowe(u) |  £l,  by 

calculations  similar  to  those  in  the  proof  of 
Theorem  2.4.1.  Since  no  placement  can  do  better 
than  the  optimal  on  each  edge,  s  incurs  at  most 
an  extra  cost  of  1  per  edge.  J 

For  some  choices  of  T,  t  and  4> ,  of  course, 
it  is  possible  to  consistently  achieve  the  median 
on  each  edge  — for  example,  if  the  mean  is  an  integer 
for  every  edge.  But  in  general,  we  have  no  global 
optimality  results.  In  §4  we  obtain  optimal 
results  for  a  special  case. 

Example  2.4.2.  If  T  is  a  complete  binary 

k  k 

tree  with  leaf  vertices  L,  }  L  }  *=  2  ,  t  =  a*2 
for  integer  a,  and  <f>  the  uniform  distribution 
on  the  leaves, then  the  placement  s  such  that 
s(v)  =  a  for  each  veL  and  s(v)  =  0  for  veV  -  L 
is  optimal,  because  it  achieves  the  integer  mean  on 
each  edge. 

Example  2.4.3.  Let  T  be  the  tree  depicted 
below,  <}>  as  indicated  on  the  leaves  and  0 
elsewhere,  and  t  =  12. 


Then  the  means  are  indicated  on  each  edge  below, 
so  all  resources  can  be  placed  at  the  levels 
indicated  by  the  circled  numbers. 


i 
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2.5.  Nodes  with  Zero  ™acements 

We  conclude  this  section  with  a  somewhat  sur¬ 
prising  characterization  theorem.  It  says  that  if 
there  is  a  probability  less  than  4  of  any  request 
arriving  in  a  subtree,  thwu  it  is  bad  to  place 
even  a  small  fraction  of  a  resource  anywhere  in 
that  subtree. 

Theorem  2.5.1,  Let  eeE  satisfy  (l-p)*"  >  h* 

where  p  -  J  <£(v)  .  Let  s  be  optimal  for  4>, 
veB(e) 

t.  Then  s(v)  -  0  for  all  veB(e). 


Theorem  3.1.1.  Let  T,  <f>,  L,  n  be  as 

above.  Let  t  e  N,  til,  and  let  s :  V  R+  be  an 
exact  fair-share  placement  with  total(s)  *  t.  Let 

e  €  E  and  p  -  J  4>  ( v)  ,  m  *  I  L  r.  B  ( e )  ,  .  Then 
vcB(e) 

expcost^Cs)  s  ci^mtp^ 

Here  c  is  a  fixed  constant  independent  of  the 
choice  of  T,  $>,  t,  s,  or  e. 

Before  proving  the  theorem,  we  need  some 
technical  lemmas. 


Proof .  Assume  not,  a.;d  fix  e,  s  exhibiting 
the  contrary.  Choose  e*  below  e  to  be  a  lowest 
edge  for  which  x  *  s(low(e'))  >0.  Define  a  new 
placement  s',  where  s^lowCe’))  -  0,  s*  (highCe1)  - 
s(high(e'))  +  x,  and  s’(v)  -  s(v)  otherwise. 

We  show  that  expcost(s*)  <expcost(s),  contra¬ 
dicting  the  optimality  of  s. 

By  Theorem  2.2.1,  e*  is  the  only  edge  for 
which  s  and  s'  have  different  expected  absolute 
flows,  so 

expcost(s)  -  expcost(s') 

*  expf  low^ f  ( total (e  ’  ,s) )  -  expflow^,  (total  (e',s’) ) 

*  expflowe,(x)  -  expf low^ , (0) . 

Let  r  “  £  <t(v).  Applying  Theorem  2.2.2*  the 

veB(e') 


Lemma  3.1.2.  For  t  £  N,  til,  sm  -  1, 
0<p  s  1,  it  is  the  case  that 

s 

I  (hp1(l-p)t"i(tp-i)  *  tp(t'1)ps(l-p)t"s. 
i-0 

Proof .  (^)pi(l-p)t  i(tp-i) 

•  tp(J)p1(l-p)f'i  -  t(J‘j)pi(l-p)t"i. 

Since  (J)  «  (*  *)  +  ^  J)  »  this  expression  is  in 
turn  equal  to 

tP(ti1)p1(1“P)t_1  +  tp(^)p1(l-p‘)t_1 

.,t-l.  i,.  ,t-i 
-  t(il)p  (l-p> 


difference  is 

l  (hri(l-r)t"i(lx-ll  -  i) 
i-0 

>  x(l-r) 1  +  l  (hri(l-r)t'i(-x) 
i-1 

-  x(2(l-r)t  -  1). 

Since  (1-r)  C  2  (1-p)  C  > ‘j.  the  difference  >  0, 
giving  the  needed  contradiction.  □ 

3.  BOUNDS  ON  THE  COSTS  OF  OPTIMAL  PLACEMENTS 

In  this  section,  we  assume  that  T  is  a  com¬ 
plete  (balanced)  binary  tree  (except  for 
rootedge(T))  with  leaves  L,  and  let  n  *  I  L  I. 
Assume  $  is  arbitrary,  except  that  as  always 
$(root)  *  0.  We  show  that  optimal  placements  are 
much  better  than  centralized  placements;  in  fact, 
their  expected  cost  is  linear  in  the  number  of 
leaves  of  the  tree. 

3.1.  Cost  of  Exact  Fair-Share  Placements 


_  ,t-lv  i,.  vt-i  _  ,t-l,  i-1/,  vt-u-P 

tp(  i  )p  (1-p)  -  tp(i-i^p  C 1— p> 


Thus  l  (bp1(l-p)t"1(tp-i> 
i-0 


(pHl-p^tp  +  tp 


V  /t-lv  i/ ,  v  t-i 

I  (  ,  )p  u-p) 
i-1 


r  ,t-l.  i-1/.  , t- (i-1) 

~  L  i>P  d“P> 

i-1  1 


tp(l-p)  +  tp 
s-1 


l  (^brhi-p)*-1 

i  =  1 


-  I  (t71)pi U”P^t-i 


i-0 


tp 


( 1-p) c  +  tp[~(ts1)pS(l-p^t  s  -  (l(^l)r°i  l-|Ol  | 

L  j 


tpC^5 )ps  ( l-p) 1  s. 


Since  we  do  not  have  a  direct  characteriza¬ 
tion  of  optimal  placements,  we  instead  bound  the 
expected  cost  of  an  arbitrary  exact  fair-share 
placement.  This  upper  bound,  of  course,  provides 
an  upper  bound  for  optimal  placements.  Moreover, 
bv  Theorem  2.4.3,  the  costs  cannot  differ  by  more 
than  2n. 


Lemma  3.1.  3.  If  t  t  N,  t  1 ,  0  ■  p  •  1  ,  then 


l  ( *  )?* ( 1-p) 1 1 tp- i | 
i-0 


2tp( 


t-1 

LtpJ 


)  P 


L^Ju-r 
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Proof.  7  (‘)pi(l-p)t'i|tp-l| 

i-0 

-  ^  (bpi(l-p)t_1(tp-i) 
i-0 

-  I  (Spi(l-p)t_i(tp-i) 

i-LtpJ+i 

-  2  ^f^)Pi(1"P,t~i<tp"1) 


-  I  (‘)pi(l-p)t  i(tp-i) 

-  2cp(^)p^P-J(X-P)t"LtpJ 

-  \  (J)pia-p)t'i(tP-i) 
i-0  1 

by  Lemma  3.1.2.  But 

[  (,)pl(l-p)t_i(cp-i) 

i-0 

-  I  <J)PlU-p)t_iep-  I  (J)p1(l-l»>t"il 


tP  £  (J)p1a-p)t'i 
i-o 

r  t~lv  i-1  /  .  \  (t-l)-(i-l)_  *  _  rj 

-  tp  1  (i_i)p  (1-p)  -  cp  -  tp  -  u  . 

1=1 


Lemma  3.1.  A.  For  teN,  t>l,  0<p<l, 
tp  >  1,  it  is  the  case  that 


Therefore, 


tp  LtpJ 

*  (^r)  * 


ltpj  (t-LtpJ) 


2irt  j_tpj 


£d.(1.±.,.,  -£Jvrw.(JE-,w 

,  (1  +  4t)  l.tpj  • 


ltpj  _  i  i  L^J  ,  N 


t-.M.  <  i 


«nd  (r^r)  u 


The  lemma  follows. 


Lemma  3.1.5.  Let  T,  t,  t  be  as  in  Theorem 

3.1.1.  Let  e  £  E  and  let  p-  J  *(v) .  Then 
, —  veB(e) 

expf low  (tp)  <  6« vtp  . 


Proof.  If  p  *  0  or  p  =  1 ,  then 
expf  low  (tp)  *=  0.  Assume  0  ^  p  ^  1.  By 

Theorem  2.2.2  and  Lemma  3.1.3, 

expflow  (tp)  =  \  ( Jp  (1-P)  I  tp— i  ! 

e  1=0  1 

=  2tp(-|j^-)  <i-p)t~i-tp3pLtpJ 


Proof.  A  version  of  Stirling's  formula  says 
that  for  all  ni  N,  nM,  it  is  the  case  that 

(“)n/2rrn  <  n!  *  (^)n/I^n  (  1  +  ~  )  . 

tp  <■  t  so  [jtpJ  t  -  1  and  t  -  [tpj  1. 


If  tp  i  1,  Lemma  3.1.4  shows  this  is  at  most 

2tp  *  (1  +  rr)  — ~ 


s  e  /  'jl’j 

tp  •  (t>  *  ~~ —  *  - 

*2»  [tpj 


,  t-i .  t-LpJ  ,  t  , 

W  ~  W 

M#’  (1+fe) 

*  t 


( ,'2~.  [_cpj  ( t- |_tpj )  •  Ltpj  ^  (t-LpJ 


S  e 

.  2  .  2  .  (4)  *  —  *  ‘tr 


6**'tp  . 

On  the  other  hand,  if  tp  ■  1,  then 

2tp.(|t-,|Hl-p)t'LtP-JPL,PJ‘2tr.(l-P*t. 


In  any  case, 


expflow^ (tp)  •  n*»tp  . 


Proof  of  Theorem  3.1.1.  Define  a  function 
G:  N*R*"+  R+  recursively: 


G(0,  u)  *  6  •  •'u  ; 

G(k,  u)  -  max  {G(k-lf u^) + G(k-l,u2) 

+  6  •  »/u  |  u^  +  u^^u}  ,  k>l  . 


get 

G(k,a  •  2k)  =  2  •  G(k-1,  a  •  2k-1)  +  6  •  2k/2  •  /T 

k 

Now,  divide  both  sides  bv  2  and  express  in  terms 
of  G',  where  G'(k,  a)  = G(k,  a  •  2k)/2k,  to  get 

G'  (k,  a)  =G'(k-l,  ■ 


We  first  show  by  induction  on  m  that 
expcoste(s)  SG(iog2  tpg) •  Since  s  is  an  exact 

fair-share  placement,  total(e,s)  =  tpfi. 

m*l:  Then  low(e)  is  a  leaf,  so 

expcost^s)  *  expflowe (total (e , s) ) 

<  6  •  i/tp  by  Lemma  3.1.5 
=  G(0,  tpe). 

m>l:  Then  m«2  since  T  is  a  complete 

binary  tree.  Let  e^,  e^  be  the  two  immediate 

descendant  edges  of  e.  Then 

expcoste(s)  * expf low^Ctotal (e , s) ) 

2 

+  [  expcost  (s). 

i-1  ei 


By  induction, 

expcost  (s)^G(k-l,  tp  ),  1-1,  2. 

i  i 


Again  using  Lemma 
expcoste 


3.1.5,  we  have 

2 

(s)  <  6  •  Sif  +  l  G(k-1,  tp  ) 
e  i=l  i 

<G(k,  tpe)  *  G(log2m,  tpe> 


tp  +  tpo  £  tpo 


Also , 


G'  (0,  a)  =  G(0 ,  a)  =  6  •  Sa  . 
Hence , 


G ' (k,  a)  =  l 


6  •  >/a 

~TUT 


*  6  •  Jl  l  2 

i=0 


-i/2 


i=0  2 

The  series  is  convergent,  so  let  c-6  •  J 


i=0 

Then  Gf(k,a)  .  Substituting  back,  we 

G (k,  u)  -  2k  •  G ’  (k,  u/2k)Sc*2k/2*.u 
as  desired. 


get 


We  conclude  that 

expcost^Cs)  ^GClog^m,  tp^)  Sc*  ^rntp^  . 

Corollary  3.1.6.  Let  T,  $,  L,  n,  t  be 

as  in  Theorem  3.1.1.  Let  s  :  V  -+  R+  be  an  exact 
fair-share  placement  for  T,  <f>,  t,  and  let 
sT:  V -►  N  be  a  fair  whole  placement  for  T,  c,  t. 
Then 

expcost(s)  S  c’v'nt 

and 

expcost (s  * )  <  c • ^nt  +  2n  -  1. 

Proof .  The  first  bound  comes  from  a  direct 
application  of  Theorem  3.1.1  to  rootedge(T).  The 
second  bound  follows  from  the  first  using  Theorem 
2. A. 3  and  that  fact  that  an  n-leaf  binary  tree 
has  at  most  2n-l  edges  (including  rootedge(T) ) . 


We  complete  the  proof  of  the  theorem  by  show¬ 
ing  that 

G(k,  u)  <  c  •  2^2  . 
for  some  constant  c. 


H 

3.2.  Cost  of  Centralized  Placement 

s  is  a  centralized  placement  if  s(v)  *  0 
everywhere  except  at  low(rootedge (T)),  and 
s  (low(rootedge(T)))=  t .  For  such  an  s, 


First  observe  that  for  each  keN,  G(k,  u)  , 
regarded  as  a  function  of  u,  is  concave  and  mono- 
tonically  increasing.  This  is  clearly  true  for 
k“0.  For  k>0,  we  know  inductively  that 
G(k-1,  u)  is  concave  and  mono t on i rally  increasing. 
Hence,  G(k*l,u^)  +  G(k-l,u9)  5  2  •  G(k-1,  (ui+u2)/2) 

<  2  •  G(k-1,  u/2)  whenever  u^ +  s  u,  so 

(*)  G(k,  u)  -  2  •  G(k-1,  u/2)  +  6  •  v'JI  , 
and  G(k,  u)  is  concave  and  increasing. 


expcost(s)  *  t  log^Cn). 

Note  that  the  ratio  of  expected  cost  for  a 
centralized  placement  to  a  fair  whole  placement  is 
at  least 

t  log^Cn) 

c  y'T  +  2n 

When  t  is  small  relative  to  n,  then  the  cen¬ 
tralized  placement  is  superior,  for  reasons 
similar  to  those  discussed  in  Section  2.5. 


We  proceed  to  solve  the  recurrence  equation 
k 

(*).  First,  substitute  a  •  2  for  u  in  (*)  to 
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However,  for  t =  Q(n) ,  the  fair  whole  placement 
is  better  by  a  factor  of  ^(log^n),  and  for 

t  »  n,  the  ratio  approaches  /t,  that  is,  the 
centralized  placement  is  worse  by  a  square. 

Note  also  that  for  t*ft(n),  the  fair  whole 
placement  has  linear  expected  cost,  so  the  expected 
cost  per  request  is  a  constant,  whereas  the  cost 
per  request  for  a  centralized  placement  is  log^n. 

4.  OPTIMAL  PLACEMENTS  FOR  SPECIAL  DISTRIBUTION 

In  this  section,  we  give  several  characteriza¬ 
tion  results  and  algorithms  for  optimal  placements 
for  a  further  restriction  of  the  case  considered  in 
Section  3  in  which  we  assume  $  is  the  same  on  all 
leaves  and  zero  elsewhere.  Specifically,  we  assume 
the  following  for  this  section: 

(a)  T  *  (V,  E)  is  a  complete  binary  tree 

with  leaves  L£V,  n=  |l|,  except  that,  as 
before,  the  root  has  a  single  emanating  edge, 
rootedge(T) . 

(b)  <p(v)  =  1/ n  for  all  vcL,  and  <p(  =0  for 

all  veV-L. 


Let  T  ■  (V  Et)  denote  the  tree  consis^iiig 

of  the  vertices  of  T  at  levels  from  0  to 

riog2t1  +  1  inclusive  and  the  edges  of  T  between 

them.  The  leaves  L^  of  T  are  the  vertices  at 

level  [iogjtl+l.  and  nt  —  | L  |  •  Let  <|>t  be 

defined  on  the  vertices  of  T  by  6  (v)  =  1/n 

t  J  t  t 

for  all  v  e  Lt  and  <f>t(v)  =  0  for  veV^-L^  If 
s:  V R+  is  such  that  s(v)  *  0  for  all  vcV  at 
levels  below  riog2t~|+l,  then  s^:  V^_  -►  R+  can 

be  defined  from  s  by  simply  ignoring  the  missis 
vertices.  Then  it  is  easy  to  see  that 

i'T'J  /O'  \ 

expcostv  }  (s)  =  expcost^  t}  (s^)  +  t  •  (log2n  -  Tlog^tl). 

(Superscripts  distinguish  the  trees  under  consi¬ 
deration.)  We  then  have  the  following. 

(T) 

Theorem  4.1.2.  If  t  > 1,  then  minexpcost 

_  /  *T1  \ 

=  minexpcost''  t  +t  •  (log^n-  I”log0t~|). 

Proof .  (>)  follows  from  Theorem  4.1.1  and 

the  remarks. 


(c)  teN,  t>l. 

We  define  the  level  of  a  vertex  to  be  its 
distance  from  the  root.  By  our  conventions  regard¬ 
ing  rootedge(T),  the  leaves  are  at  level 
log2n+  1. 

For  the  special  case  being  considered  in  this 
section,  certain  of  the  relevant  definitions  can 
be  generalized  to  reflect  the  symmetry  in  the  tree. 
For  example,  if  e^  and  are  edges  with 

high(e^)  and  high(e2)  at  the  same  level  k, 

then  expflow  =  expflow  .  Therefore,  we  write 
el  e2 

expflow^  in  place  of  either.  Similarly,  we  write 
minexpcost^  for  minexpcost^,  where  k  is  the 
level  of  high(e). 

4.1.  A  Bound  on  Levels  with  Nonzero  Placements 

Theorem  2.5.1  can  be  used  to  bound  the  level 
at  which  nonzero  placement  can  occur  in  an  optimal 
placement . 

Theorem  4.1.1.  Let  v  e  V  be  at  level 
k£  flog2tl  +2.  Let  s  be  optimal  for  t. 

Then  s(v)  =  0. 

Proof .  By  Theorem  2.5.1,  it  suffices  to 
show  that  (1-  - - - )t  >  -|  for  t  £  N.  It 

2riog2t  +  fi 

also  suffices  to  consider  t  a  power  of  2.  In 
this  case,  the  inequality  is  just  (l-yj')11^  » 
which  follows  by  an  easy  induction  on  log2t.  0 


(<)  follows  because  any  placement  for  T^ 

can  be  augmented  to  a  placement  for  T  by  placing 
zeros  on  the  additional  nodes,  thereby  incurring 
the  stated  cost.  [] 

Thus,  in  the  remainder  of  this  section,  we 
assume  that  the  maximum  level  in  T  is  at  most 

flog2tl+l  (that  is,  n<2t).  The  reader  can 

then  use  Theorem  4.1.2  to  infer  corresponding 
results  about  cases  where  n£2t. 

Example  4.1.1.  The  case  where  t  = 1  is 
somewhat  peculiar.  The  reader  can  verify  that  for 
all  T  with  maximum  level  number  at  least  2,  the 
following  pictures  all  represent  optimal  placements. 


levels :  0 

:  A  A  A 


That  is,  in  the  first  case, 

s(v)  =  j  1  for  v  the  son  of  the  root, 
i  0  otherwise , 

while  in  the  other  two  cases, 
s(v)  =  {  1 


for  v  the  left  (resp.  right) 
grandson  of  the  root, 

otherwise . 
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This  is  the  only  value  of  t  for  which  an  optimal 
placement  can  have  nonzero  values  below  level 

flog  t]  +  1. 

v 

Example  4.1.2.  If  t  *  2  and  n  >  t,  then 
the  placement  with  one  resource  on  each  of  the  t 
vertices  at  level  k  +  1  is  optimal:  nothing  is 
placed  below  this  level,  and  an  optimal  placement 
for  the  whole  tree  results  from  an  optimal  place¬ 
ment  within  levels  up  to  log  t  +  1.  But  clearly 
putting  one  resource  across  all  leaves  of  is 

optimal,  as  seen  in  Example  2.4.2.  L 


4.2.  Algorithms  for  Finding  Optimal  Placements 

In  this  section,  we  prove  two  sharper  versions 
of  Theorem  2.2.4  for  the  special  case  of  this 
section,  and  use  them  as  bases  for  two  algorithms 
for  finding  optimal  placements. 

Theorem  4.2,1.  Let  t  e  N,  u  e  R+  and  keN, 
k  £ log2n  -  1.  Then  minexpcost^(u)  -  expflow^(u) 

+  2  *  min  { mi  nexp  cos  tk+1  (u' )  |  u  ’  e  {0, 1,  * . . ,  JjJ ,  j  }}. 

Proof.  £  is  clear.  We  show  >,  Write  f  for 
minexpcostk+^.  Consider  any  u^,  u2  e  R+  with 
u  +u2su  and  f(u^)  +  f(u2)  minimal.  We  will 

produce  uf  €  (0,1, . . . ,  |yj ,  }  with  2f(u') 

<  f(ux)  +  f(u2). 

By  Theorem  2.2.5,  there  exists  r e  N  such  that 

r  is  the  smallest  element  of  R+  with  f(r)  £f(s) 
for  all  s >  r.  We  consider  two  cases. 


Case  1.  u  2  2r. 


Choose  u’=r.  u'  €{0,1,.. 

and 

2f(u’)  <  f(ux)  +  f(u2). 

Case  2.  u  <  2r. 

Then  u^  +  u2*u.  Choose  u 

Then 

ul+u2 

2f(u’)*2fHy^)  if(Ul)  +  f(u2) 

by  convexity 

An  appeal  to  Theorem  2.2.4 

completes  the 

proof . 

Theorem  4,2.2.  For  any  t e N,  there  exists 
s  such  that  s(v)*s(v')  for  all  pairs  of  ver¬ 
tices  v,  v'  at  the  same  level,  which  is  optimal 
for  t . 


During  the  algorithm,  one  must  calculate 
expf  loWg(t)  ,  expflow1  (-|) , . . .  ,expf  lowlQg(n)  (~)  .  In 
addition,  one  must  calculate  expflow^(i)  for  all 
ieN,  i  exp  flow  2  (-j)  for  all  ieN,  i 

expflow^Oj)  for  all  ieN,  expf lowiog(n) 

(— )  for  all  ieN,  is-  . 
n  n 

This  is  a  total  of  0(t  log  n)  expflow  computa¬ 
tions.  Since  each  such  computation  involves  0(t) 
arithmetic  operations,  we  have  the  following 
theorem. 

Theorem  4.2.3.  There  is  an  algorithm  using 

2 

0(t  log  n)  arithmetic  operations  which,  for  any 
tree  T  with  n  leaves  satisfying  the  assumptions 
of  this  section  and  for  any  t  €  N,  determines  an 
s  which  is  optimal  for  t  such  that  s(v)  =  s(v') 
for  all  pairs  of  vertices  v,  vf  at  the  same 
level. 

N^te  that  the  bound  of  Theorem  4.2.3  represents 
an  improvement  over  the  bound  in  Theorem  2.3.2 
applied  to  the  special  case  of  this  section;  the 
placements  produced  by  the  two  algorithms  have 
somewhat  different  properties,  however.  The 
remainder  of  this  subsection  deals  with  integral 
placements,  the  situation  considered  in  Theorem 
2.3.2. 


Theorem  4.2.4.  Let  t,  u,  k  c  N ,  k£log0n-l. 


Then 

minexpcostk(u)  *  expflow^Cu)  +  min {minexpeost^^ 
+  minexpcostk+1(u2)  |  ,u7  e  N  ,u^  +  \i.y  <  u, 

and  I  u2  **  u|  I  -  1 )  • 


(V 


Proof.  Again,  we  show  Write  f  for 
minexpcostk+^ .  By  Theorem  4.2.1,  there  is  a  value 

u*  €  {0, 1, . . . , |yj  ,  such  that  minexpcost^Cu)- 
expflowk(u)  +  2f (u’ ) .  If  u*c{0, . . . ,  Jjj }#  then  we  sim¬ 
ply  take  u^  *=  u2  =  u ' .  If  u’  /  N,  then  we 

take  ui  “  |jJ  and  u2  *  ffj ;  in  this  case» 

piecewise  linearity  of  f  guarantees  the  required 
properties.  □ 

Theorem  4.2.5.  For  any  t  c  N,  there  exists 
s:  V  •*>  N  which  is  optimal  for  t.  Moreover,  s 
has  the  additional  properties: 


Proof .  s  can  be  found  by  a  recursive 
algorithm  based  on  Theorem  4.2.1.  □ 

We  proceed  as  before  to  analyze  the  cost  of 
finding  the  placement  of  Theorem  4.2.2.  During 


(a)  if  and  e2  are  two  edges  with  high(ej) 

“highle^),  then  |  total  (e^  ,s)  -  total  (e2  ,s)  |  <  1 , 

(b)  if  e  is  any  edge  with  high(e)  at  level  k, 
then  total(e,s)  s  p— "j  . 
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Proof.  Theorem  4,2.4  leads  naturally  to  a 
recursive  algo’  Ithm  yielding  a  placement  in  V+N 
and  obviously  satisfying  (a).  To  see  that  (b) 
is  also  satisfied,  assume  the  contrary,  and  let 
e  be  a  highest  edge  in  the  tree  with 


total(e,s)  > 


-“j ,  where  high(e)  is  at  level  k; 


Since  total (e,s)  € N ,  we  have  total(e,s)£ 


+  1 


r+1.  e  is  not  rootedge(T),  so  let  e1  be 

2 

the  edge  ^  e  with  high(e)  =  high(e'),  and  let 
e"  be  the  edge  immediately  above  e  in  T.  Then 


total  (e"  ,s)  <  I-  C 


k-1 


.k-1 


+  1. 


Therefore, 

total(e's)  <  total(eM,s)  -  total(e.s) 


But  then 

|  total  (e,s)  -  total  (e  * ,  s)  |  >1, 

contradicting  property  (a) .  □ 

Once  again,  we  analyze  the  cost  of  determin¬ 
ing  an  optimal  placement  with  the  properties  of 
Theorem  4.2.5.  One  must  calculate  expflow^(t), 

and  also  expflow^(i)  for  all  i  e  N,  is  ft/2*]  , 

expflow2(i)  for  all  icN,  i  £  ft/*!  *  •  •  • » 

expf  low^  ^  (i)  for  all  i  e  N,  i  S  ft/n]  .  This  is  a 

total  of  0(t)  expf low  comDutations ,  (since  we 
are  assuming  that  n  is  0(t)). 

Theorem  4.2.6.  There  is  an  algorithm  using 

2 

0(t  )  arithmetic  operations,  which  for  any  tree 
T  and  for  any  t  €  N,  determines  an  s:  V -►  N 
which  is  optimal  for  t,  and  which  satisfies  con¬ 
ditions  (a)  and  (b)  of  Theorem  4.2.5. 

4.3.  A  Fast  Algorithm  for  Determining  Optimal 
Placements 


Proof.  Again,  we  show  Write  f  for 
minexpcost^+^.  By  Theorem  4.2.4,  there  is  a  pair 

u^,  u2  €  N  such  that  u^  +  u^u,  Si 

and  minexpcostjju)  -  expflow^Cu)  +  f(u^)  +  f(u2). 

Of  all  such  pairs,  choose  one  minimizing 
u-(u^  +  u2)  and  assume  u^u^,  We  must  check 

that  Uj,  u2  c{ 


t 

t 

2k+l 

* 

2k+l 

1 1 

r  t  * 

2k+l 

|  ,  then  u2  ^  j 

|2k+1 

r  n 

-  — 

r 

t 

.  Then  u  £  u^  +  u2  £  2 

t 

+  1  > 

1 

JL 

2k+l 

2k+l 

[2k 

a  contradiction .  Thus,  u^  (and  therefore  u^) 


■vk+1 


If  u1  < 


„k+l 


,  then  < 


-k+1 


-  1 ,  so 


u2  S 


,k+l 


Then  u  -  (u^  +  u2>  £ 


-  ( 


,k+l 


-  1  + 


,k+l 


)  >1,  Define  a  new  decomposi¬ 


tion  w^,  w2  by  w1*u^  +  l,  w2«u9.  Then 
Wi4v2<u  and  )w2-Wj|sl.  Now,  expf  low^j  (w^ ) 

S  expf low^^ (u^ )  because  < 

Theorems  2.2.3,  2.4.1  and  2,4.2,  Thus,  f(w^) 

S  f(u  ),  by  Theorem  2.2.4.  Hence,  w^f  w?  also 
satisfies  the  conditions  used  in  choosing  u^,  u0, 

but  u  -  (u^  +  u2)  >  u  -  (w^  +  w9) ,  contradicting  the 
minimality  condition.  □ 


Theorem  4.3.2.  For  the  special  case  of  this 
section,  there  exists  a  fair  whole  placement  s 
which  is  optimal  for  t. 


The  results  of  Section  2.4  can  be  used  to 
prune  the  a Igor i ram's  search  space  still  further, 
leading  to  a  much  faster  algorithm. 


Theorem  4.3.1.  Let  t,  u,  kcN, 


k  £  log (n)  -  1, 


Let 

t, 

t 

<u5 
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Then 


minexpcost^(u)  *  expf low^(u) 


+  min{minexpcost^+^(u^)  +  minexpcost^^u^  | 


V  V£{ 


t 

t 

~k+l 

/  -j 

* 

2k+l 

}  and  u| +  u2  ^  u} . 


Proof .  Theorem  4,3.1  yields  a  recursive 
algorithm.  Q 


Note  that  a  result  similar  to  Theorem  4.3.2 
does  not  hold  in  the  general  case  —  recall 
Example  2.4.1. 


The  algorithm  resulting  from  Theorem  4.3.2  is 
extremely  fast.  Namely,  one  must  calculate 


expf low^C i ) 


for 


for  each  k, 


1  <k  slog(n),  for  a  total  of  only  0(log  n) 
expf low  computations. 


Theorem  A. 3. 3.  There  is  an  algorithm  using 
0(t  log  n)  arithmetic  operations  which  any 
tree  T  and  for  any  t  e  N,  determines  a  fair 
whole  placement  s  which  is  optimal  for  t. 


Example 


Some  of  the  optimal  placements  discovered  by 
our  algorithms  are  rather  unexpected.  For  example, 
the  following  represents  an  optimal  placement  of 
11  resources  in  a  balanced  binary  tree  with 
uniform  probability  distribution: 
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